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Abstract 


We  consider  dynamic  electromagnetic  evasion-interrogation  games  in  which  the 
evader  can  use  ferroelectric  material  coatings  to  attempt  to  avoid  detection  while  the 
interrogator  can  manipulate  the  interrogating  frequencies  to  enhance  detection.  The 
resulting  problem  is  formulated  as  a  two-player  zero-sum  dynamic  differential  game  in 
which  the  cost  functional  is  based  on  the  expected  value  of  the  intensity  of  the  reflected 
signal.  We  show  that  there  exists  a  saddle  point  for  the  relaxed  form  of  this  dynamic 
differential  game  in  which  the  relaxed  controls  appear  bilinearly  in  the  dynamics  gov¬ 
erned  by  a  partial  differential  equation.  We  also  present  a  computational  framework  for 
construction  of  approximate  saddle  point  strategies  in  feedback  form  for  a  special  case 
of  this  relaxed  differential  game  with  strategies  and  payoff  in  the  sense  of  Berkovitz. 


AMS  subject  classifications:  83C50,  91A23,  49N70,  49N90,  65M32,  68T37,  60J70. 


Key  Words:  Electromagnetic  evasion-pursuit,  uncertainty,  differential  games,  theory  and 
approximation,  backward  Kolmogorov  equations. 


1 


Report  Documentation  Page 


Form  Approved 
OMB  No.  0704-0188 


Public  reporting  burden  for  the  collection  of  information  is  estimated  to  average  1  hour  per  response,  including  the  time  for  reviewing  instructions,  searching  existing  data  sources,  gathering  and 
maintaining  the  data  needed,  and  completing  and  reviewing  the  collection  of  information.  Send  comments  regarding  this  burden  estimate  or  any  other  aspect  of  this  collection  of  information, 
including  suggestions  for  reducing  this  burden,  to  Washington  Headquarters  Services,  Directorate  for  Information  Operations  and  Reports,  1215  Jefferson  Davis  Highway,  Suite  1204,  Arlington 
VA  22202-4302.  Respondents  should  be  aware  that  notwithstanding  any  other  provision  of  law,  no  person  shall  be  subject  to  a  penalty  for  failing  to  comply  with  a  collection  of  information  if  it 
does  not  display  a  currently  valid  OMB  control  number. 


1.  REPORT  DATE 

21  FEB  2011 


2.  REPORT  TYPE 


4.  TITLE  AND  SUBTITLE 

A  Zero-Sum  Electromagnetic  Evader-Interrogator  Differential  Game 
with  Uncertainty 

6.  AUTHOR(S) 


7.  PERFORMING  ORGANIZATION  NAME(S)  AND  ADDRESS(ES) 

North  Carolina  State  University, Center  for  Research  in  Scientific 
Computation, Department  of  Mathematics, Raleigh, NC, 27695-8212 

9.  SPONSORING/MONITORING  AGENCY  NAME(S)  AND  ADDRESS(ES) 


3.  DATES  COVERED 

00-00-2011  to  00-00-2011 

5a.  CONTRACT  NUMBER 

5b.  GRANT  NUMBER 

5c.  PROGRAM  ELEMENT  NUMBER 

5d.  PROJECT  NUMBER 

5e.  TASK  NUMBER 

5f.  WORK  UNIT  NUMBER 

8.  PERFORMING  ORGANIZATION 
REPORT  NUMBER 

CRSC-TR11-04 

10.  SPONSOR/MONITOR'S  ACRONYM(S) 

11.  SPONSOR/MONITOR'S  REPORT 
NUMBER(S) 


12.  DISTRIBUTION/AVAILABILITY  STATEMENT 

Approved  for  public  release;  distribution  unlimited 

13.  SUPPLEMENTARY  NOTES 

14.  ABSTRACT 

We  consider  dynamic  electromagnetic  evasion-interrogation  games  in  which  the  evader  can  use 
ferroelectric  material  coatings  to  attempt  to  avoid  detection  while  the  interrogator  can  manipulate  the 
interrogating  frequencies  to  enhance  detection.  The  resulting  problem  is  formulated  as  a  two-player 
zero-sum  dynamic  differential  game  in  which  the  cost  functional  is  based  on  the  expected  value  of  the 
intensity  of  the  reflected  signal.  We  show  that  there  exists  a  saddle  point  for  the  relaxed  form  of  this 
dynamic  differential  game  in  which  the  relaxed  controls  appear  bilinearly  in  the  dynamics  governed  by  a 
partial  differential  equation.  We  also  present  a  computational  framework  for  construction  of  approximate 
saddle  point  strategies  in  feedback  form  for  a  special  case  of  this  relaxed  differential  game  with  strategies 
and  payoff  in  the  sense  of  Berkovitz. 

15.  SUBJECT  TERMS 


16.  SECURITY  CLASSIFICATION  OF: 

17.  LIMITATION  OF 

18.  NUMBER 

19a.  NAME  OF 

ABSTRACT 

OF  PAGES 

RESPONSIBLE  PERSON 

a.  REPORT 

unclassified 

b.  ABSTRACT 

unclassified 

c.  THIS  PAGE 

unclassified 

Same  as 
Report  (SAR) 

32 

Standard  Form  298  (Rev.  8-98) 

Prescribed  by  ANSI  Std  Z39-18 


1  Introduction 


In  an  electromagnetic  evasion-interrogation  game,  the  evader  wishes  to  minimize  the  in¬ 
tensity  of  the  reflected  signal  to  remain  undetected  in  carrying  out  his  mission  while  the 
interrogator  wishes  to  maximize  the  intensity  of  the  reflected  signal  to  detect  the  attacker. 
The  results  in  [5]  demonstrated  that  it  is  possible  to  design  ferroelectric/ferromagnetic  ma¬ 
terials  with  appropriate  dielectric  permittivity  and  magnetic  permeability  to  significantly 
attenuate  reflections  of  known  electromagnetic  interrogation  signals  from  highly  conductive 
targets  such  as  airfoils  and  missiles.  However  the  results  in  [6]  showed  that  if  the  evader 
employed  a  counter  interrogation  design  based  on  a  fixed  set  of  known  interrogating  frequen¬ 
cies,  then  by  a  rather  simple  counter- counter  interrogation  strategy  (use  of  an  interrogating 
frequency  little  more  than  10%  different  from  the  assumed  evader  design  frequencies),  the 
interrogator  can  easily  defeat  the  evader’s  material  coatings  counter  interrogation  strategy 
to  obtain  strong  reflected  signals.  Thus,  one  can  readily  conclude  from  these  two  results  that 
the  evader  and  the  interrogator  must  each  try  to  confuse  the  other  by  introducing  significant 
uncertainty  in  their  design  and  interrogating  strategies,  respectively. 

Based  on  this  consideration,  a  static  electromagnetic  evasion-interrogation  game  (in  the 
spirit  of  mixed  strategies  introduced  by  von  Neumann  [37])  was  considered  in  [2],  where  the 
problem  is  mathematically  formulated  as  a  minimax  game  over  sets  of  probability  measures 
taken  with  the  Prohorov  metric.  In  this  case  this  is  equivalent  to  the  weak  star  topology 
for  the  set  of  probability  measures  considered  as  a  subset  of  the  dual  C*  of  C,  the  bounded 
continuous  functions  with  the  supremum  norm.  In  this  formulation,  the  evader  does  not 
choose  a  single  coating,  but  rather  has  a  set  of  possibilities  available  for  choice  and  only 
chooses  the  probabilities  with  which  he  will  employ  the  materials  on  a  target.  By  choosing 
his  coatings  randomly  (according  to  a  best  strategy  to  be  determined  in  a  minimax  game), 
he  prevents  adversaries  from  discovering  which  coating  he  will  use  -  indeed,  even  he  does  not 
know  which  coating  will  be  chosen  for  a  given  target.  The  interrogator,  in  a  similar  approach, 
determines  best  probabilities  for  choices  of  frequency  and  angle  in  the  interrogating  signals. 
Using  compactness  and  approximation  properties  in  the  context  of  the  Prohorov  metric,  the 
authors  in  [2]  present  a  rather  complete  theoretical  and  computational  framework  for  these 
static  problems.  A  more  realistic  (for  some  scenarios)  dynamic  setting  is  initially  introduced 
in  [3]  by  consideration  of  time  dynamics  in  the  problem,  wherein  the  evader  is  allowed  to 
make  dynamic  changes  to  his  strategies  in  response  to  the  dynamic  input  information  with 
uncertainty  on  the  interrogator’s  actions. 

In  this  paper,  we  consider  a  two  player  zero-sum  differential  game  in  an  infinite-dimensional 
space,  where  the  cost  functional  is  based  on  the  intensity  of  reflected  signals.  In  this  for¬ 
mulation,  both  evader  and  interrogator  choose  a  probability  measure  at  each  time  t  in  the 
presence  of  material  uncertainty  which  is  modeled  as  a  stochastic  process.  The  outline  of 
this  paper  is  as  follows.  In  Section  2  we  present  a  description  of  our  problem  formulation 
and  show  that  there  exists  a  saddle  point  for  the  resulting  relaxed  differential  game.  Then 
in  Section  3  we  present  a  computational  framework  for  construction  of  approximate  sacldle- 
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point  strategies  in  feedback  form  for  a  special  case  of  this  differential  game.  Some  summary 
remarks  are  given  in  Section  4. 


2  Problem  Formulation  and  Saddle  Points  for  the  Re¬ 
laxed  Differential  Game 


The  cost  functional  is  based  on  the  intensity  of  reflected  signals  from  an  object  such  as 
an  airfoil  or  missile  coated  by  a  radar  absorbent  material  of  constant  thickness.  There  are 
several  ways  to  treat  the  electromagnetic  scattering  [5,  6].  One  fundamental  approach  is  to 
employ  the  far  field  pattern  for  reflected  waves  computed  directly  using  Maxwell’s  equations. 
As  detailed  in  [2] ,  in  two  dimensions  for  a  reflecting  body  with  a  given  coating  layer  with  an 
interrogating  plane  wave  E^\  the  scattered  field  E ^  satisfies  the  Helmholtz  equation  [12]. 
An  alternative  and  much  less  computationally  expensive  one  (as  well  as  equally  accurate  in 
this  setting  -  see  [5,  6])  is  to  calculate  the  reflection  coefficient  based  on  a  simple  planar 
geometry  (e.g.,  see  Fig.  1)  with  Fresnel’s  formula  for  a  perfectly  conducting  half  plane. 


C’ 


Figure  1:  Interrogating  high  frequency  wave  impinging  (angle  of  incidence  0)  on  coated 
(thickness  d)  perfectly  conducting  surface 


We  will  use  the  reflection  coefficient  to  measure  the  strength  of  backscattering.  We  assume 
that  a  normally  incident  electromagnetic  wave  with  the  angular  frequency  a ;  is  assumed  to 
impinge  the  half  plane.  Then  the  corresponding  wave  length  in  the  air  is  2hc/uj,  where  the 
speed  of  light  is  c  =  3  x  108.  Thus,  the  reflection  coefficient  R  for  a  wave  impinging  on 
a  coating  layer  of  thickness  d  with  relative  dielectric  permittivity  e  and  relative  magnetic 
permeability  /j,  is  given  by 

R(H,  e,  u,  d)  =  r^+  r2  ,  (2.1) 

1  +  rir2 
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where 


and  r2  =  exp  ( 2iy/ejlujd/c ) . 


(2.2) 


£  -  y/efi 

r'i  = - 1 — 

e  +  yT/i 


This  expression  can  be  derived  directly  from  Maxwell’s  equation  by  considering  the  ratio 
of  reflected  to  incident  waves,  for  example,  in  the  case  of  parallel  polarized  ( TEX )  incident 
wave  (e.g.,  see  [5,  23]). 


Control  of  reflections  by  the  evader  is  effected  via  local  currents  in  a  composite  layered  reflec¬ 
tor  device  that  can  be  used  to  control  the  dielectric  permittivity  and  magnetic  permeability 
in  a  target  coating  layer  as  discussed  above  and  in  more  detail  in  [5].  The  reflector  contains 
a  ferrite  layer  and  a  ferroelectric  layer  as  constituents.  The  key  element  of  the  device  is  that 
the  material  properties  /i  =  n(H)  and  e  =  e(E)  of  the  composite  layers  are  controllable  in 
terms  of  the  magnetic  mean  and  the  electric  mean  in  the  layers,  and  thus  can  support  agile 
frequency  attenuation.  Control  is  implemented  via  local  circuits  which  can  produce  rapidly 
changing  E  fields.  Since  the  E  and  H  fields  are  connected  via  Maxwell’s  equations,  if  the 
evader  controls  the  dielectric  permittivity  e  via  these  local  E  fields,  this  also  produces  rapid 
changes  in  the  magnetic  permeability  /x. 


For  our  formulation  we  assume  that  the  evader  “controls”  dielectric  permittivity  of  the 
surface  coatings  by  choosing  parameters  £  =  Re(e)  from  a  compact  admissible  set  £  C  M+ 
in  a  measurable  (i.e.,  t  — >  e{t)  is  a  measurable  function)  time  dependent  manner.  (Here 
M+  denotes  the  set  of  non- negative  real  numbers.)  This  produces  changes  in  the  magnetic 
permeability  which  for  our  initial  formulation  here  we  assume  incorporates  uncertainty  into 
the  reflected  signal.  For  simplicity,  we  assume  the  real  part  x  of  the  magnetic  permeability 
H  =  x  +  i/ij  of  the  coating  has  uncertainty  described  by  an  Ito  diffusion  process  Xt  satisfying 
the  stochastic  differential  equation 


dXt  =  b(Xt)dt  +  a(Xt)dWt.  (2.3) 

Here  Wt  denotes  the  standard  Brownian  motion  and  both  b  =  b(x)  (the  mean  rate  of  change 
for  x  =  i?e(/x))  and  o  are  non-random  functions  that  are  assumed  to  be  Lipschitz  continuous. 
In  addition,  we  assume  that  the  interrogator  has  control  of  the  frequency  to  of  the  interro¬ 
gating  electromagnetic  signals.  At  each  time  t  G  [ t0,T ]  (t0  >  0),  the  interrogator  chooses 
parameters  to  from  a  compact  admissible  set  C  M+. 

We  now  can  readily  formulate  our  problem  as  a  zero-sum  differential  game,  where  the  cost 
functional  is  dependent  on  the  expected  value  of  the  intensity  of  the  reflected  signal. 
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2.1  Evolution  of  the  Expected  Value  of  Intensity  of  Reflected  Sig¬ 
nal  and  a  Dynamic  Differential  Game 


Let  x(xXY)  =  | R(x  +  ifj*i,£  +  i€i,u,d)\,  where  /i,  and  e*  denote  the  imaginary  parts  of  n 
and  e,  respectively,  which  are  assumed  fixed  throughout  this  presentation.  We  then  define 


v(t,x)  =  E;t: 


s  to)x(Xs,e(s),a;(s))ds  +  uo(V) 
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where  Ex[  •  ]  denotes  the  expectation  with  respect  to  the  probability  law  of  {Xt  :  t  >  f0} 
when  its  initial  value  is  X '(to)  =  x,  A  >  0  is  a  discount  parameter,  and  v0  is  a  nonnegative 
function  that  is  used  to  denote  the  initial  (t  =  t0)  intensity  of  reflected  signal. 


Following  a  standard  technique  for  treating  integrals  (see  Section  10.3  of  [31]),  we  next  define 
the  Ito  diffusion  Yt  in  M2  by 


dYt  =  d 


Xt 

Zt 


b{Xt) 

A  ex^x(Xt,e(t),u(t)) 


dt  -\- 


<xt) 

0 


dWt 


Let  g(t,x,z )  =  E [Zt  +  vo(Xt)  \  Y(to )  =  (x,  z)T],  where  E[  ■  |  •  ]  denotes  the  conditional 
expectation.  Then  we  have 

v(t,x)  =  g(t,x,0). 

Here  the  generator  of  the  Ito  diffusion  process  {Yt  :  t  >  to}  is 

did2  d 

Ccp(x,z)  =  b(x)  —c/>(x,z)  +  -a2(x)  —<j>{x,z)  +  XeHt-to)x(x,£(t),uj(t))—  </>(x,z). 

It  then  follows  from  Section  8.1  in  [31]  that  g  satisfies  the  backward  Kolmogorov  e quation 


d 

Qj_g  =  £g,  g{to,x,z)  =  z  +  v0(x). 


(2.4) 


A  discussion  of  the  relationship  between  this  state  and  the  semigroup  generated  by  C  can 
be  found  in  [15]. 


Since  g  =  v  +  z  is  the  solution  to  (2.4),  it  follows  that  v  satisfies 
d  d  1  d2 

-Qt Ht,x)  =  b(x)  +  2a2(x)  +  A eMt~to)x(x,£(t),u(t)), 

v(t0,x)  =  vQ(x). 

Now  let  v(t,x)  =  e~x^~to^v(t,x).  It  is  easy  to  show  that  v  satisfies 
d  d  i  d2 

— u(t,a;)  =  b(x)  -^v{t,x)  +  ^\x)  —v(t,x)  -  A v(t,x)  +  Ay(a:,e(i),a;(t))t 
v(t0,x)  =  v0(x). 
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We  note  that  the  state  v  in  this  formulation  is 


v(t,  x )  =  E;£ 


Ae  x(t  s)x(Xs,e(s),uj(s))  ds  +  e  x(t  to)v0(Xt) 


\_J  tQ 


the  expected  value  of  a  measure  of  the  reflected  intensity. 


We  restrict  x  to  be  in  a  finite  interval  [x,  x] ,  and  set  the  boundary  conditions  to  be  zero. 
Thus  we  will  consider  the  state  equation 


d  d  i  9  d2 

^ v(t,x)  =  b{x)  -^v{t,x)  +  2(t2(x)  ^  Xv{t,x)  +  A  x(x,e(t),a;(t)), 

v(t,x)  =  0,  v(t,x)  =  0,  (2-5) 


v(t0,x)  =  v0(x). 

The  objective  of  the  game  for  the  evader  is  to  choose  a  strategy  such  that  the  intensity  of 
the  reflected  signal  is  as  small  as  possible  while  the  objective  for  the  interrogator  is  to  choose 
a  strategy  so  that  the  intensity  of  the  reflected  signal  is  as  large  as  possible.  Hence,  the  cost 
functional  for  a  zero-sum  differential  game  with  uncertainty  can  be  formulated  by 

pT  px 

J(e,u)=  /  /  v(t,  x;  £,uj)dxdt.  (2.6) 

J  to  J  X_ 


A  number  of  approaches  have  been  used  in  the  literature  to  study  infinite-dimensional  dif¬ 
ferential  games.  One  approach  is  based  on  the  theory  developed  by  Elliott  and  Kalton  [16] 
for  differential  games  in  Euclidean  spaces.  For  example,  an  infinite-dimensional  differential 
game  on  the  infinite  horizon  was  studied  in  [24]  with  strategies  in  the  sense  of  Elliott  and 
Kalton,  and  the  value  function  of  the  differential  game  is  characterized  as  the  unique  vis¬ 
cosity  solution  of  the  Hamilton-Jacobin-Isaacs  equation.  The  other  approach  is  based  on 
the  theory  developed  by  Berkovitz  [7]  for  differential  games  in  Euclidean  spaces,  wherein 
the  definition  of  strategy  is  a  combination  of  “K  strategies”  discussed  by  Isaacs  [22]  and 
Friedman’s  lower  strategy  (e.g.,  see  [19,  20])  and  the  definition  of  payoff  and  saddle  point 
follows  that  of  Krasovskii  and  Subbotin  [25].  For  example,  infinite-dimensional  differential 
games  with  strategies  and  payoff  in  the  sense  of  Berkovitz  were  studied  in  [21]  and  [35]  for 
finite  horizon  and  infinite  horizon,  respectively,  and  the  value  function  is  characterized  as 
the  unique  viscosity  solution  of  Hamilton- Jacobi-Isaacs  equation.  It  should  be  noted  that 
the  principle  result  of  all  of  these  investigations  is  that  if  the  so-called  Isaacs  condition  holds 
then  the  differential  game  has  a  value.  The  interested  readers  can  refer  to  [34]  for  a  readable 
short  review  on  the  history  of  differential  games. 

For  the  game  that  we  present  here,  the  Isaacs  condition  does  not  hold  as  the  function 
4>(x)x(x,£,uj)dx  are  in  general  not  quasiconvex  in  £  (e  £)  and  quasiconcave  in  u  (e  f!) 
for  any  (j)  E  L2(x,x).  In  other  words,  our  game  may  not  have  a  value.  A  common  approach 


6 


that  is  used  to  circumvent  this  difficulty  is  to  enlarge  the  class  of  controls  to  include  relaxed 
controls  (e.g.,  see  [17,  32,  41]).  Hence  we  will  consider  the  game  in  a  corresponding  relaxed 
form  in  the  remaining  of  this  paper. 


2.2  Relaxed  Differential  Game 

The  notion  of  relaxed  control,  or  generalized  curve,  was  introduced  into  the  calculus  of 
variations  (in  the  40’s)  and  optimal  control  (in  the  60’s)  by  a  number  of  distinguished 
contributors  such  as  Young  [42,  43],  McShane  [28,  29,  30],  Filippov  [18]  and  Warga  [38, 
39,  40].  Since  then,  it  has  been  studied  by  many  other  researchers  (e.g.,  see  [1,  10,  11,  27]). 

Before  we  give  the  relaxed  forms  for  (2.5)  and  (2.6),  we  will  introduce  needed  theoretical 
background  information  on  relaxed  controls  (e.g.,  see  [17,  39,  40]).  Let  C(fi)  and  C{£) 
denote  the  spaces  of  continuous  functions  equipped  with  usual  supremum  norm,  and  C*(fi) 
and  C*{£)  be  their  corresponding  topological  dual  spaces  taken  with  the  weak  star  topology 
which  is  equivalent  to  the  Prohorov  metric  topology  [9,  33]  used  in  the  static  games  in  [2],  We 
define  the  spaces  b^(fi)  and  £?{£)  as  the  spaces  of  all  regular  probability  measures  defined 
on  the  Borel  subsets  of  fi  and  £,  respectively.  Then  with  the  Prohorov  metric,  b^(fi)  and 
£P(£)  are  compact  and  convex  subsets  of  C*(fi)  and  C*(£),  respectively.  In  addition,  as 
noted  above  convergence  in  the  Prohorov  metric  is  equivalent  to  weak  star  convergence.  For 
more  information  on  Prohorov  metric,  the  interested  readers  can  refer  to  [9,  33]. 

Let  L1  (t0,  T;  C(fi))  be  the  Banach  space  of  Lebesgue  integrable  functions  from  [ t0,T ]  to 
(7(11)  with  the  norm 

\\gu\\Li(t0,T-,c(n))  =  /  \\guj(t)\\c(n)dt. 

Jt0 

The  Banach  space  L1  (t0,  T;  C(£))  and  its  norm  is  similarly  defined,  ft  is  known  that 
both  L1  (t0,T;  C(£l))  and  L1  (t0,  T;  C(£))  are  separable.  We  denote  the  topological  dual 
of  L1(f, o,  T ;  C(fi))  and  Ll{t 0,  T ;  C(£))  by  L1^,  T ;  C(fi))*  and  L1^,  T ;  C (£))*,  respectively. 
By  the  Dunford-Pettis  theorem  (e.g.,  see  [40,  Theorem  1V.1.8]),  we  have  the  equivalence  that 

L 1  (t0,T;C(n))*  ^  L°°(t0,T;  C*(ty) 

and 

L 1  (t0,T-C(£))*  C*(£)). 

Here  L°°(to,T ;  C*(Q))  is  a  Banach  space  of  essentially  bounded  measurable  functions  from 
[to,T]  to  C*(fl)  with  the  norm 

||$w|U~(to,T;C*(n))  =  ess  sup  I  ^(f)  |(H). 

t(E.[to,T] 

The  Banach  space  L°°(to,  T\C*(£)))  and  its  norm  is  similarly  defined.  However,  in  this  paper 
we  shall  consider  L°°{t 0,T;  C*(£t))  and  L°°(t o,T;  C*{£))  taken  with  the  weak  star  topology. 
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A  sequence  {dq^}  in  L°°{t0 ,  T ;  C*(f2))  is  said  to  be  convergent  in  this  topology  if  there  exists 
a  point  dq  G  L°°(t0,T;  C*(f2))  such  that  for  any  gu  G  Ll{t 0,T;  C(f2))  we  have 


lim 

j->o o 


gu(t,uj)$Utj(t)(du)dt 


gu(t,u)^LJ(t)(duj)dt. 


'to  J  ft 


The  convergence  of  a  sequence  in  L°°(t0,T;  C*(£))  with  the  weak  star  topology  is  similarly 
defined. 


A  relaxed  control  for  the  interrogator  is  a  mapping  dq  :  [to,T]  — >  and  this  mapping 

is  measurable  (respectively,  continuous)  if  /  hw(u>)<!>w(t)(du>)  is  measurable  (respectively, 

Jn 

continuous)  function  of  t  G  [f0,  T]  for  every  continuous  real-valued  function  /q  on  Q.  A 
relaxed  control  for  the  evader  dq  :  [t0 ,  T ]  — >  is  dehned  similarly.  We  shall  identify 

these  controls  which  differ  only  on  a  set  of  measure  zero.  Let 

38(12)  =  {dq  |  dq  :  [to,T\  — >  3^(12)  is  measurable}. 

and 

38(£)  =  {dq  |  dq  :  [t0,T]  — >  38(£)  is  measurable}. 

Let  38q  and  3Se  denote  the  unit  ball  of  L°°(f0,T;  C*(fi))  and  L°°(t0,T ;  C*(£)),  respectively. 
That  is, 

38n  =  {$e  L°°(to,T]  C*(£2))  |  ||$|Uoc(t0,T;cqn))  <  1} 

and 

8$e  =  G  L°°(t0,T;C*(£ ))  |  ||<L||L°°(t0,T;C*(£:))  <  !}• 

Then  the  weak  norm  topology  and  weak  star  topology  of  3§q,  (respectively,  38s)  coincide,  and 
with  this  topology  38q  (respectively,  38 z)  is  a  compact  metric  space  (see  [40,  Theorem  1.3.11 
and  Theorem  1.3.12]).  Note  that  for  any  dq  G  38(12)  and  dq  G  38(£)  we  have  dq(f)(fi)  =  1 
and  dq(f)(£)  =  1.  Hence,  3% (fi)  C  38q  and  38(£)  C  38 e.  In  addition,  we  have  the  following 
important  results. 

Theorem  2.1.  (See  [40,  IV. 2.1]  or  [17,  Theorem  3.9])  The  sets  38(32)  and  38(£)  can  be 
considered  as  closed  convex  subsets  of  the  unit  ball  ofL°°(t0,  T;  C*(f2))  and  L°° (f0,T;  C* (£)) , 
respectively,  so  with  the  weak  star  topology  both  38(32)  and  38(£)  are  compact. 


Let  dq  G  38(32)  and  dq  G  38(£).  Then  by  Lemma  3.13  in  [17]  we  know  that  dq  x  dq  is 
a  measurable  relaxed  control  on  £  x  32,  and  dq  x  dq  can  be  considered  to  belong  to  the 
unit  sphere  of  the  topological  dual  L°°(to,T-,C*(£  x  fi))  of  id^o,  T;  (7(£  x  fl)).  With  this 
background  information  on  relaxed  controls,  we  can  now  reformulate  the  state  equation  (2.5) 
in  relaxed  control  form 

d  d  i  d2 

—v(t,  X )  =  b(x)  x)  +  2a2^  u(*> _  x )  +  /(*»  X)i 

v(t,  x)  =  0,  v(t,  x )  =  0, 
v{to,x )  =  v0(x), 
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(2.7) 


where 


f(t,x)  =  X  /  x(x,£,u)^£(t)(d£)  $u(t)(du). 
Jn  Js 


The  cost  functional  corresponding  to  the  relaxed  controls  <Le  and  <3?^  is  defined  by 

pT  px 

J($£,$u)  =  /  /  v(t,x;^£,^U})dxdt. 

J  to  J  X_ 


(2.8) 


(2,9) 


Hence,  for  this  relaxed  formulation  (2.9)  with  (2.7),  the  evader  does  not  choose  a  single 
coating  at  each  time  t ,  but  rather  has  a  set  of  possibilities  available  for  choices.  The  inter¬ 
rogator,  in  a  similar  approach,  determines  best  probabilities  for  choices  of  frequency  in  the 
interrogating  signals  at  each  time  t. 

Remark  2.2.  From  (2.1),  it  is  easy  to  see  that  x  is  continuous  on  [x,  x]  x  £  x  fb  By 
assumption  both  £  and  are  compact.  Hence,  x  is  bounded.  Let 


fe(t,x,u ) 


X(x,e,u)$e{t)(de). 


Then  by  the  Lebesgue  dominated  convergence  theorem  we  know  that  fe(t,  •,  •)  is  continuous  on 
[x,  x]  x  for  fixed  t,  and  by  the  definition  of  relaxed  controls  we  know  fe(-,  x,  uj)  is  measurable 
for  fixed  (x,u>).  In  addition,  we  have 


\fe(t,x,uj)\  <  ||x||c(|x,*]x£xn)^e(t)(f) 

(2.10) 

=  llxllcfeslxfxfi)- 

Thus,  f£  G  L°°(to,  T;  C(\x_,  x\  x  fl)),  which  implies  that  f£  e  L1(t0,  T;  C([x,  x\  x  fl)).  Note 
that 

f(t,x)  =  \  [  fe(t,x,u)$u(t)(du}). 

Hence,  f(t,  •)  is  continuous  on  [x,  x]  for  fixed  t,  and  f(-,x)  is  measurable  for  fixed  x.  Simi¬ 
larly,  we  find 

\f(t,x)\  <  A||x||cd2,*]xfxn).  (2.11) 

Thus,  f  G  L°°(t0,T]C([x,x])).  In  addition,  by  Fubini’s  theorem  we  can  exchange  the  order 
of  integration  in  (2.8). 


2.3  Existence  of  Saddle  Points  for  Relaxed  Differential  Game 


In  this  section  we  show  that  the  relaxed  form  of  the  minmax  dynamic  differential  game  for 
(2.9)  subject  to  (2.7)  has  a  saddle  point.  We  assume  that  there  exists  a  positive  constant 
ainf  such  that  a(x)  >  crinf  for  any  x  G  [x,  x] .  Let  HI  =  L2(x,x),  V  =  H(\  (x,  x),  and  denote 
the  topological  dual  space  V*  by  V*  =  H  [(x,  x).  If  we  identify  HI  with  its  topological  dual 
HI*  then  V  HI  ~  HI*  V*  forms  a  Gelfand  triple  [26].  Throughout  this  presentation 
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||  •  ||h  and  ||  •  || v  and  ||  •  ||y«  are  used  to  denote  the  norms  in  HI,  V  and  V*,  respectively,  (•,  •) 
denotes  the  inner  product  in  HI,  and  (•,  -)v*,v  represents  the  duality  paring  between  V*  and 
V.  Following  standard  conventions,  we  use  an  over  dot  (  ')  to  denote  the  derivative  with 
respect  to  the  time  variable  t,  and  use  prime  ( ' )  to  represent  the  derivative  with  respective 
to  the  space  (i.e.,  permeability)  variable  x.  In  addition,  for  convenience  we  may  use  ||  •  ||oo 
to  denote  both  the  norms  in  L°°(x,x)  and  C{\x_,x\). 


Define  the  sesquilincar  form  a  on  V  x  V  by 

«(0,  VO  =  ~(H',  VO  +  ^{<P',  (^W)  +  M<P,  VO-  (2.12) 


By  Remark  2.2,  we  know  that  /(f)  G  C{[x_,x]).  Hence,  we  may  rewrite  (3.2)  in  the  weak 
form 


(*(*),  VO  v*,v  +  a(v(t),ip)  =  (/(f),  VO  > 
v(t0)  =  Vo 


(2.13) 


for  any  ip  G  V.  Here  and  elsewhere  v(t)  and  /(f)  denote  the  functions  v(t,-)  and 
respectively. 


Theorem  2.3.  Let  vq  G  HI  and  assume  a  is  Lipschitz  continuous  with  b  G  L°°(x,x).  Then 
there  exists  a  unique  solution  v  for  (2.13)  with  v  G  H1(t0,T]V*)  fl  L2(f0,T;  V).  In  addition, 
there  exists  a  positive  constant  k  such  that  for  any  t  G  [f0,  T\ 


IM*)|||  <  «  (llvblll  +  £  ||/(a)||^ds)  , 

and 

[T\\v(t)\\lds<n(\\vo\g  +  [T  WmWUs]  . 
Jto  V  Jt.0  J 

Furthermore,  we  have  v  G  C7(f0,T;HI). 


(2.14) 


(2.15) 


Proof.  Note  that  V  is  continuously  imbedded  in  H ,  and  H  is  continuously  imbedded  in  V*. 
Hence,  there  exists  a  constant  7  >  0  such  that 


IIV’IIh  <  7ll'0l|v, 

for  any  ip  G  V, 

(2.16) 

and 

||h|  y»  <  7||h||H, 

for  any  h  G  HI. 

(2.17) 

Since  a  is  Lipshcitz  continuous,  o'  G  L°°(x,x) 

.  Thus,  by  (2.16)  and  (2.17) 

we  find  that  for 

any  0,  ip  G  V  we  have 

|a(0,0)|  <  ||&||oo||0/||h||,0||h  + -||o'2||oo||0/||h||'0/||h 

+  ||cr||oo||°‘,||oo||/),||H||'0||H  +  A||0||h||'0||h 

<  7ll&l|oo||0||v||0||v  +  ^lk2||oo||0||v||0||v 

+7lklloo|k,||oo||0||v||'0||v  +  72A||0||y||'0||y. 
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Let  q  =  7||&||oo  +  - 1| cr2 1| 00  +  7||o'||oo||°',||oo  +  72A.  Then  by  the  above  inequality  we  have 


K0;VOI  <  i? 1 1 0 1 1 v 1 1 0 1 1 v ;  for  any  0,0  G  V. 

For  any  0  G  V  we  also  obtain 

T 


(2.18) 


a(0,0)  >  (  -(Tint-  -  20  )  ||0||2 


2  +  llcrll2  Ilk  II 2 

OO  1  II  II OO  II  1 1 OO 


49 


m 


Setting  6  =  -ofnf,  we  have 


a(0,0)  +  «h||0||h  >  av||0||v, 


(2.19) 


where  ay  =  -afnf  and  am  = 


2  +  ||cr||2  Ik'll2 

OO  1  II ^  II oo II ^  Moo 

49 


By  Remark  2.2,  we  know  that  /  G 


L°°(t0,T;C([x,x])).  Hence,  /  G  L2(t0,T;  V*)-  Thus,  by  Theorem  2.1  in  [4]  we  know  that  for 
any  u0  G  HI  there  exists  a  unique  solution  v  for  (2.13)  with  v  G  iL1(t0,  T]  V*)  nL2(i0,T;¥), 
and  (2.14)  and  (2.15)  hold  for  some  positive  constant  k.  Furthermore,  v  G  C(t0,  T;  H),  and 
thus  the  initial  condition  in  (2.13)  is  meaningful.  □ 

Remark  2.4.  By  Remark  2.2,  we  know  that  f(t )  G  C(\x,x])  C  HI.  Thus,  we  can  easily 
obtain  from  (2.11) 

\\f(t)\\m<^  =  {x-x)X2\\x\\2ci^]x£Xny  (2.20) 

By  (2.14),  (2.15),  (2.17)  and  (2.20)  we  find 


and 


'to 


K0)||h  <  «  (IMIh  +  (T-  t0)72«n)  , 


|u(t)||vds  <  k  (||u0||h  +  (T  -  t0)-f2Km) 


(2.21) 


(2.22) 


From  (2.21)  and  (2.22),  we  see  that  both  ||u(0||h  and  /  ||i>(£)||yds  are  bounded  by  a  positive 

■Jto 

constant  which  is  independent  of  the  choices  of  and  40. 

Remark  2.5.  Letv(t,x)  be  the  solution  to  (2.13).  Then  by  (2.16)  and  (2.18)  we  find 
\(v(t),  0)v*,v|  =  |  -  a(u(0,0)  +  (/(*),  0)| 

<  p|kWllvll0llv  +  7||/WIIh||0||v, 

which  implies  that 

JkWHv*  =  sup  {|(u(i),0)v,v|  I  0  G  V} 

IRIIv<i 

<  p||u(0||v  +  7||/(0||h. 

By  (2.20)  and  the  above  equation,  we  obtain 

|2„  <  2p2||u(t)||y  +  272kh. 
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Thus,  by  (2.22)  and  integrating  the  above  equation  we  have 


v{t)\\l*dt  <  2q2k\\v0\\11  +  2 (T 


t0)^2 Km(Q2 K  +  l) . 


(2.23) 


From  the  above  equation  we  see  that  /  ||b(t)||y*dt  is  bounded  by  a  positive  constant  that  is 


't0 


independent  of  the  choices  of  and  <F£. 


From  the  definition  for  J  defined  in  (2.9),  to  show  J  is  separately  continuous  in  each  of 

its  variables,  it  suffices  to  show  that  for  given  <f>£  G  &{£)  and  a  sequence  {$^1  C  &(Ft) 

converging  to  in  &{Vt)  we  have 

/•T  px  pT  px 

lim  /  /  v(t,  x;  $£,  $u,i)dxdt  —  /  /  v(t,  X]  <f>£,  Q^dxdt,  (2.24) 

i->°°  Jt0  Jx  ’  Jto  Jx 

and  for  given  G  &(Ft)  and  a  sequence  {<F£J}  C  &{£)  converging  to  <f>£  in  &(£)  we  have 

pT  px  pT  px 

lim  /  /  v(t,  x;  <F£  j,  $u)dxdt  —  /  /  u(t,  $£,  (2.25) 

j->oojto  Jx  ’  Jt0  Jx 

Actually  by  using  (2.21),  (2.22)  and  (2.23)  and  similar  arguments  as  in  [11,  Lemma  2.1], 
we  can  show  that  (2.24)  and  (2.25)  both  hold.  For  convenience,  we  will  show  (2.24)  in  the 
following  lemma  (similar  arguments  can  be  used  to  establish  (2.25)). 

Lemma  2.6.  Let  <f>£  G  &(£),  and  assume  that  the  sequence  C  V)  is  convergent 

to  inM{£t).  Then  (2.24)  holds. 


Proof.  For  notational  convenience,  we  let  fj  =  /(•,•;  <F£,  (fq,,.,)  and  Vj  =  v(-,-  ;  fI>£,  4Aj)- 
By  (2.21),  (2.22)  and  (2.23)  ,  we  know  that  {vj}  is  bounded  in  C'(i0,T;EI)  and  also  in 
L2(t0,T]  V),  and  {Lj}  is  bounded  in  L2(t0,T]V*).  Thus,  there  exists  a  subsequence  -  again 
denoted  by  Vj  -  such  that 

Vj  v  weakly  in  L2(t0,  T;  V), 
i)j  v  weakly  in  L2(to,T]V*). 

Observe  that  V  is  also  compactly  imbedded  in  H .  Hence,  by  Theorem  2.1  in  [36]  we  have 

Vj  v  strong  in  L2(t0,  T ;  HI).  (2.26) 

We  further  observe  that  L2(t0,T;  HI)  is  continuously  imbedded  in  L1{to,T-1  L 1(x,x)).  Hence, 
by  (2.26)  we  know  that  Vj  is  strongly  convergent  to  v  in  L1(t0,  T;  L1^,  x)),  which  means 

pT  px  pT  px 

lim  /  /  v(t,x]$£,$UJj)dxdt=  /  /  v(t,x)dxdt. 

Jto  Jx  ’  Jt.0  Jx 
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Thus,  to  complete  the  proof  we  only  need  to  show  that  v  —  v(-,  • ;  <3>e,  <!><*,). 


Let  g(t,x )  =  rj(t)'ip(x),  where  -0  G  V,  and  rj  G  C'1(t0,T)  with  -//(to)  =  0  and  77(F)  =  0.  We  set 
v  =  Vj  in  (2.13),  and  then  multiply  (2.13)  by  77(f)  and  integrate  to  find 


f  {vj(t),ip)Y*,vrl(t)dt+  [  a{vj{t),ip)‘n{t)dt=  j  {fj{t)^)'q{t)dt. 

Jt  0  Jt  0  Jt  0 

Integrating  by  parts  for  the  first  term  of  the  above  equation,  we  have 


/  (vj(t),i{>)fi(t)dt+  /  a(vj(t),i/j)r)(t)dt=  /  (fj(t),i/>)ri(t)dt.  (2.27) 

b0  *7  tQ 


By  Fubini’s  theorem,  the  right  side  of  (2.27)  can  be  written  as 


'to 


' to  \_Jx_ 
px 


\f>{x)  (  /  fe(t,x,u)$Uj(t)(du)  )  dx 


=  /  A^(a;) 


to  J  Q 


rj(t)f£(t,x,u)^oj!j(t)(dw)dt 


rj(t)dt 

dx. 


(2.28) 


By  Remark  2.2,  we  know  that  fe  G  F°°(to,  F;  C([x,  x]  x  ft)).  Since  77  G  C'1(to,F),  we  have 
77 f£  G  L°°(t0,  F;  C([x,  x]  x  ft)),  which  implies  rjf£  G  L1(f0,  F;  (7([x,  x]  x  ft)).  Since  is 
convergent  to  in  t^(ft),  letting  j  00,  passing  to  the  limit  in  (2.28)  and  using  Fubini’s 
theorem  we  find 

(. fj(t),ip)v(t)dt=  [  (/ (t),ip)r)(t)dt. 

Jto 

Now  we  let  j  — >  00  and  pass  to  the  limit  term  by  term  in  (2.27)  to  obtain 


(v(t),  if) 77(f) dt  +  /  a(v(t),  ip) rj(t) dt 


>t0 


'to 


{f(t),ip)v(t)dt. 


Integrating  by  parts  for  the  first  term  in  the  above  equation,  we  find 


((#(f),  ^)v*,v  +  a(u(f),  ^))  v(t)dt 


(f(t),ip)v{t)dt. 


(2.29) 


Note  that  the  class  of  rj’s  for  which  the  above  holds  are  dense  in  F2(t0,F).  Hence,  we  have 
(2.29)  holding  for  all  77  G  F2(t0,F).  Thus,  we  have  v  satisfies  the  first  equation  of  (2.13). 
To  obtain  v(t0)  =  v0,  we  may  use  the  same  arguments  with  arbitrary  77  G  C1(f0,T)  with 
77(F)  =  0  but  77 (t0)  7^  0.  Therefore,  by  the  uniqueness  of  the  solution  for  (2.13)  we  have 
v  =  v.  □ 


Remark  2.7.  Since  the  example  given  in  [17]  shows  that  the  identity  mapping  from  M{S)  x 
t^(fl)  —>  £%{£  x  ft)  is  not  jointly  continuous,  the  cost  functional  J  defined  by  (2.9)  is  not 
jointly  continuous  over  the  space  M[S)  x  t^(ft). 
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Theorem  2.8.  (See  [44,  Corollary  3.2])  Let  X  be  a  nonempty  compact  and  convex  subset  of 
a  Hausdorff  topological  vector  space,  and  let  Y  be  a  nonempty  convex  subset  of  a  Hausdorff 
topological  space,  respectively.  Suppose  that  /  :  X  x  Y  K  satisfies  (i)  for  each  fixed 
x  G  X,  y  i — »  ^(x,y)  is  lower  semicontinuous  and  quasiconvex;  (ii)  for  each  fixed  y  G  Y, 
x  i — »  Jt(x,y)  is  upper  semicontinuouos  and  quasiconcave.  Then  we  have 

max  min  f  (x,  y)  =  min  max  f(x,y). 

Moreover,  if  Y  is  compact,  then  J?  has  a  saddle  point  in  X  x  Y. 


Note  that  J  of  (2.9)  is  continuous  and  linear  in  each  variable.  Thus,  by  Theorems  2.1  and 
2.8  we  find  that  J  has  a  saddle  point,  which  is  summarized  in  the  following  theorem. 

Theorem  2.9.  There  exists  a  pair  of  relaxed  controls  <f>*  G  1)  and  G  &(£)  such  that 

for  any  <he  G  &(£)  and  G  S^(Q). 


From  Remark  2.7,  we  have  that  the  cost  functional  J  is  not  jointly  continuous,  which 
implies  that  there  are  challenges  in  carrying  out  standard  numerical  approximations  (such 
as  the  delta  approximation  or  spline  approximation  employed  in  [2]  for  the  static  case  and 
the  discretization  method  used  in  [11]  for  computation  of  relaxed  optimal  control)  in  the 
domain  &(£)  x  M{£1).  To  circumvent  these  difficulties,  we  will  consider  a  special  case  of  our 
relaxed  differential  game  in  the  remainder  of  this  paper,  where  we  assume  that  both  evader 
and  interrogator  have  only  finite  number  of  choices  at  each  time  t.  Then  we  develop  a 
computational  framework  to  obtain  approximate  optimal  strategies  for  the  resulting  relaxed 
differential  game. 


3  Construction  of  Approximate  Saddle  Point  Strate¬ 
gies  for  a  Simplified  Relaxed  Differential  Game 

In  this  section,  we  assume  hi  =  (a;*,  uj*2,  . . . ,  c v(n}  C  M+,  and  £  =  {£*,£2, . . .  ,£z*}  C  M+,  and 
restrict  our  controls  to  measures  of  the  form 

m  l 

®u{t)  =  ®e(t)  =  J^u£!i(t)  Aej,  (3.1) 

j= 1  i= 1 

where  Au*  is  the  Dirac  delta  measure  with  atom  at  cu*. 

3  J 
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We  consider  the  resulting  relaxed  differential  game  (which  will  be  termed  as  simplified  relaxed 
differential  game  in  the  remainder  of  this  paper)  with  strategies  and  payoff  in  the  sense  of 
Berkovitz  [7].  Specifically,  in  Section  3.1  we  show  that  this  game  has  a  value  and  the  value 
function  is  the  unique  viscosity  solution  of  Hamilton-Jacobi-Isaacs  equation.  In  Section  3.2 
we  employ  Galerkin  approximation  techniques  to  reduce  the  problem  into  one  in  a  finite  di¬ 
mensional  space,  and  show  that  these  approximate  differential  games  have  a  value.  Moreover 
the  corresponding  value  functions  converge  pointwise  to  the  value  function  of  this  simplified 
relaxed  differential  game.  Then  in  Section  3.3  we  use  Berkovitz’s  method  to  construct  opti¬ 
mal  strategies  for  the  approximate  differential  games,  and  show  that  these  optimal  strategies 
are  the  approximate  optimal  strategies  for  this  simplified  relaxed  differential  game. 

First  we  introduce  some  necessary  notation.  Let  u£ji  :  [t0,T]  — >  M  and  vw.j  :  [ t0,T ]  — > 
M  be  measurable  nonnegative  functions,  i  =  1,2 ,...,/  and  j  =  1,2,...,  m,  which  satisfy 

l  m 

=  1  and  Uu,j(t)  —  1-  Let 

i= 1  3=1 


u£  =  (ueA,  u£,2,  •  •  • ,  uefiT,  and  u u  =  («W) i,  uw,2,  •  •  • ,  uufiT. 

Then  we  have 

u£  e  U£[t0,T],  uu  el4w[to,T], 

where  for  to  <  s±  <  S2  <  T 

44[s'i ,  s2]  =  {u  |  u  :  [si,s2]  - 1  Ul  is  measurable}, 

44j[si,s2]  =  {u  |  u  :  [si,s2]  y  U'n  is  measurable}. 

Here  for  k  eN  we  define 

Uk  =  j/i=  (/ii,/i2,...,/ifc)T  e  Rk  :  =  1,/L  >  oj  . 

Hence,  both  Ul  and  Um  are  convex  and  compact.  For  notational  simplicity,  we  shall  write 
U£  for  74  [to,  T }  and  44,  for  44,  [to,  T\. 

The  corresponding  relaxed  control  form  of  the  state  equation  (2.5)  is  then  given  by 

d  d  1  9  d2 

— u(t,x)  =  b(x)  —v(t,x)  +  -a2(x)  —v(t,x)  -  A v(t,x)  +  f(x,u£(t),uu(t)), 
v(t,x)  =  0,  v(t,x)  =  0,  (3-2) 

v(to,x)  =  v0(x). 


Here  the  function  /  :  [x,  x\  x  Ul  x  Um  — >  M  is  defined  by 

m  l 

f  (*F  AL,  k'iS)  A  'y  y  ^  /iojj l^s,%x{p^ ,  G ,  Wj )  A/i£ 
j= 1  i=  1 
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where  B(x)  is  a  /  x  m  matrix  with  its  (i,j)th  element  being  x(x,  £*,  ui*).  Observe  that 
is  continuous  in  [x,  x\.  Hence,  /(•,  fi£,  nJ)  E  C([x,  x])  C  HI  and  /  is  bounded  in 
the  domain  [x,  x]  x  Ul  x  Um.  In  the  following,  we  denote  /(•,  fi£,  nu)  by  /j,u)  for  ease 

in  presentation.  It  is  easily  seen  that  J-  \Ul  x  Um  — >  HI  is  continuous. 


3.1  Value  of  Differential  Game 


In  this  section,  we  will  study  our  differential  game  with  strategies  and  payoff  in  the  sense  of 
Berkovitz.  Specifically  we  will  use  some  of  the  results  in  [21,  35]  to  show  that  our  differential 
game  has  a  value  and  this  value  function  is  the  unique  viscosity  solution  of  Hamilton- Jacob in- 
Isaacs  equation. 


Define  the  linear  operator  A  :  V  — »  V*  by  a(0, 0)  =  (— A(j),  V')v*,v,  where  a  is  defined  in 
(2.12).  Then  (3.2)  can  be  rewritten  in  the  following  abstract  (in  V*)  form 

v{t)  =  Av(t)  +  .F(ue(t),  uw(t)),  v(t0)  =  v0.  (3.3) 


and  its  corresponding  weak  form  is  given  by 

(v(t),if))y*ty  +  a(v(t),i/;)  =  (Jr(ue(t),uu(t)),i/;),  for  any  i)  £  V 
v(t0)  =  v0. 


(3.4) 


By  (2.18),  (2.19)  and  the  arguments  in  [4,  Section  2],  we  know  that  A  generates  an  analytic 
semigroup  S(t)  on  HI,  V  and  V*.  Note  that  V  is  compactly  embedded  in  HI.  Hence,  S(t)  on 
HI  is  compact  (see  [14,  page  394]).  In  addition,  by  Theorem  2.1  in  [4],  we  know  that  for  any 
given  v0  E  HI  there  exists  a  unique  solution  v  for  (3.4)  with  v  e  ff1(t0,  T;  ¥*)  D  L2(t0,T;¥) 
given  by 

v(t)  —  S(t)vQ+  /  S(t  -  s)Jr(u£(s),uu(s))ds,  (3.5) 

Jt0 

and  there  exists  some  positive  constant  k  such  that 

\\v(t)\\u  <  K  ^0111  +  ^  ||^(«e(s),«w(s))||^.d3^  ,  te[t0,T].  (3.6) 

Furthermore,  we  have  v  E  C(t0,T;  HI). 


Let  Q  :  HI  — >■  M  dehned  by  G(<f>)  =  /  4>(x)dx  for  any  0  e  HI.  Note  that  HI  is  continuously 

J  X_ 

embedded  in  Ll(x,  x).  Hence,  for  any  0,^  E  HI  there  exists  a  positive  constant  k9  such  that 


m)-Qm  = 


(0(x)  —  0(x))dx 


<  ^110  -^lll 
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Now  we  adjoin  the  differential  equation 

v°(t)to,vQ,ue,uu)  =  Q(v(t,x-,t0,v0,ue,uu)),  v°(t0)  =  0  (3.7) 

to  (3.4).  Note  that  /  is  bounded  in  the  domain  [x,  x]  x  Ul  x  Um  and  n^)  G  H.  Hence, 
by  (3.6)  we  know  that  there  exists  some  positive  constant  k°  (independent  of  the  choices  of 
u£  and  uu)  such  that 

|u°(t;to,uo,«e,Uw)|2  <  «°(||vo||h  +  (T  -  to))  (3.8) 

for  any  t  £  [to,T]. 

We  now  define  the  strategy  for  evader  and  interrogator  in  the  sense  of  Berkovitz  (e.g.  see 
[21,  34]).  A  strategy  IN  for  the  evader  is  a  choice  of  a  sequence  ne  =  {ftf}  of  partitions  of 
[to,  T]  and  a  choice  of  a  sequence  of  maps  r]1®  =  {r^e,n},  where  r{Is,n  is  described  below.  For 
notational  simplicity,  we  will  suppress  the  dependence  on  ne  in  the  notation  and  write  T£  for 
and  F”  for  r“e>n.  Let  the  partition  points  of  it?  be  t0  =  <  t?’1  <  . . .  <  t?e'n*  =  T 

with  ||7r"||  =  max  — >■  0  as  n  — >  oo.  Each  map  T”  is  a  collection  of  maps 

l<i<n 

{r where  r”’1  £  U£[t o,^’1)  and  for  2  <j<  n£, 

:  U£[tQXe,j~l)  X  U^to,^-1)  ->■  U£[tn^j-lXe,j)- 

A  strategy  for  the  interrogator  is  similarly  defined  (by  replacing  the  subscript  e  to  oj,  and 
subscript  w  to  e). 


Note  that  a  pair  (r™,  T”)  of  nth  stage  strategies  determine  uniquely  a  pair  (n”,  u?)  £UexUi0 
as  follows.  Let  7rn  =  {to  =  tn’°  <  t”’1  <  . . .  <  tn,n  =  T}  be  the  common  rehnement  of  7r” 

and  7 r”.  The  control  functions  u?  =  (u”’1,  u”’2,,, . . ,  u”’n)  and  n”  =  (n”’1,  u™’2, _ ,  u”’n), 

where  iqn’J  G  We[iftJ_1,ift)i)  and  G  ^[A^1, T’J),  j  =  1,2 ,...,h.  Let  and  uJO  be 
the  restriction  of  n”  and  n”  to  [t0,tnj),  respectively.  On  [tn’0,tn,:L),  we  set  w™’1  =  T”’1  and 
w])’1  =  r”’1.  Let  1  <  j  <  n  —  1.  If  there  exists  1  <  i  <  n£  such  that  tn,j  =  tfe’1,  then  on 
[th’j ,th’j+1)  we  set  u?’j+1  =  Tn/+1{uneij,ulj)  and  unJ+l  =  n^-,),  where  /  is  the 

greatest  integer  such  that  t'ff”1  <=  tn,j  and  j'  is  the  integer  such  that  tj^'1  =  tn,j' .  If  there 
exists  1  <k<nu  such  that  t*J  =  t^’k,  then  on  [th’j  ,th’j+1)  we  set  u^j+1  =  F”’l+1(i 
and  u?,j+1  =  r”’fe+1(n”j,  n[0),  where  l  is  the  greatest  integer  such  that  tfs’1 
is  the  integer  such  that  t?e’1  =  in'J  .  The  pair  (u”,u™)  determined  this  way  is  called  the  nth 
stage  outcome  of  the  pair  (T",  T”)  of  nth  stage  strategies. 


Now  let  {u0,n}  be  a  sequence  converging  to  v0.  For  each  n,  we  have  the  nth  stage  trajectory 
v{-  \t0 ,u0ln,<,O  =  (v°(-  ;t0,v0,n,u^,u^))T ,  which  is  the  unique  solu¬ 

tion  to  (3.7)  and  (3.4)  corresponding  to  the  control  functions  uf  and  u?  and  initial  condition 
(0,uo,r).)T-  Note  that  S(t)  on  HI  is  compact.  Hence,  by  [21,  Lemma  2.2]  we  have  the  following 
result. 

Lemma  3.1.  Let  (n™,  n”)  be  the  nth  stage  outcome  of  the  pair  (r™,  T”)  of  strategies  on 
[to,T],  and  {u0,n}  be  a  sequence  converging  to  vq.  Then  the  sequence  {i>(-  ;  toWo,n,  u£,  UZ)} 
is  relatively  compact  in  C(to,  T;  H). 
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With  Lemma  3.1  and  inequality  (3.8),  we  now  can  define  the  concept  of  motion  in  the  game. 
Any  uniform  limit  of  a  subsequence  of  the  nth  stage  trajectories  {vf  ;  £oWo,nW"w")}^i> 
where  u0n  — >  v0,  and  is  the  outcome  of  the  pair  (T™,T™),  is  called  the  motion  of 

the  game  corresponding  to  strategies  T£  =  {T™}  and  T^  =  {T™}  that  starts  from  initial  point 
(t0,v0).  This  motion  is  denoted  by  u[-  ;  t0,  v0,  T£,  Tw]  =  (u°[-;  t0,v0,  Te,  Tw],  v[-  *  t0,v0,  Te,  Ta,])T. 
The  set  of  all  motions  u[  •  ;  foWo,  re,  rw]  corresponding  to  the  pair  (T£,TW)  is  denoted  by 
V[  •  ;  to,  vq,  T£,  Tu\.  Similarly,  we  use  V°[  • ;  to,  vq,  T£,  rw]  and  V[  • ;  to  Wo,  T£,  rw]  denote  the  set 
of  v°[  •  ;  to,  Vo,  Te,  Tw]  and  v[  ■  ;  t0,  v0,  Te,  Tw],  respectively. 

If  the  initial  point  of  the  augmented  component  of  the  trajectory  is  given  by  a  nonnegative 
number  Vq  ,  then  the  corresponding  extended  trajectory  with  controls  u£  and  u w  is  denoted 
by 

v(  ■  ;  t0,  v0,u£,  uu)  =  [uq  +  v°(  ■  ;  t0,  v0,  ue,  uu),  v(  ■  ;  t0,  v0,  u£,  ujf , 
where  v0  =  [u£,  v0)T  ■ 


To  complete  the  description  of  the  game,  we  need  to  define  the  payoff  structure.  The  payoff 
corresponding  to  a  pair  of  strategies  (Te,  rw)  is  set  valued  and  is  defined  by 

^(TejTw^ojUo)  =  V°[T;i0,uo,re,ra,]. 

The  evader  tries  to  choose  T£  so  as  to  minimize  all  elements  of  J(T£,  rw;  t0,vo)  and  the 
interrogator  tries  to  choose  Tw  so  as  to  maximize  J(Te,  T^;  to,  Vo).  Hence,  we  see  that  the 
payoff  is  not  required  to  be  evaluated  along  a  trajectory  of  the  system.  We  define 

o)  =  sup  inf  J{T£,Yu-,  t0 ,v0),  J+(t0,v0)  =  inf  sup  J{Ye,  T^;  t0,v0). 

ry  rw 


(If  {Qf 3}  is  a  collection  of  subsets  of  R,  then  sup  Q 3 

P 


=  sup  UyQp  and  inf  Qs 


=  inf  UpQp.) 


Then  it  is  easy  to  see  that  J  (to,vo)  <  J+[to,vo).  If  J  (to Wo)  =  i7+(foWo),  we  denote 
this  common  value  by  l7*(t0,u0)  and  say  that  the  game  has  a  value  equal  to  J*[t0,v 0). 


Let  Q  and  Q  be  two  sets  of  real  numbers.  We  say  that  Q  >  Q  if  for  every  q  e  Q  and  every 
q  G  Q  the  inequality  q  >  q  holds.  Also,  if  a  is  a  real  number  and  Q  is  a  set,  by  a  >  Q  we 
mean  that  a  >  q  for  all  q  G  Q.  A  similar  meaning  holds  for  a  <  Q. 


A  pair  of  strategies  (T*,  T* )  is  said  to  be  the  saddle  point  (or  optimal  strategies )  for  the  game 
with  the  initial  point  (£0Wo)  if  the  inequality 


i7(r*,  r^;  fowo)  <  ^(r^r^towo)  <  df(r£,r*;t0,w) 


holds  for  all  (T£,rw).  Note  that,  if  (T*,T*)  is  a  saddle  point,  then  J(T*,  T* ;  t0,  u0)  is  a 
singleton  and  is  given  by 

i7(r*;  r*;  fowo)  =  i7*(A>wo)- 
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We  now  consider  games,  trajectories  and  motions  with  varying  initial  points  (r,  p )  G  [to,  T]  x 
EL  Given  0  G  El  and  tp  G  El,  we  define 

=  (0, 7(ne,nu))  +  /A  e  e 

Observe  that  for  any  0,  </?  G  El,  Ft  is  continuous  and  linear  in  each  variable.  Since  Ul  and 
Um  are  both  compact  and  convex,  we  have  'H~(p,  0)  =  'H+(</3,  0)  for  any  0,  99  G  El,  where 

1-L~((p,(j))=  max  min  'H(pe,Pu,  </?,  0),  I-L+(<p,(f))  =  min  max  %(/xe,  /4,;  </?,  0). 

He&U1 

With  this  equality  we  say  that  Isaacs  condition  is  satisfied.  Let  K*  =  "H-  =  H+.  Then  by 
[21,  Theorem  3.2]  we  have  that  the  differential  game  has  a  value  J*  =  LW  =  77+  and  it  is 
the  unique  viscosity  solution  of  the  Hamilton-Jacobi-Isaacs  (HJI)  equation 

J:  +  (A<P,  DT)  +  DT)  =  0,  (r,  ip)  G  (to,  T)xi 

J*(T,<p)  =  0  1  j 

in  the  sense  of  Crandall-Lions  [13].  Here  DJ*  denotes  the  Frechet  differential  of  J*  with 
respect  to  p  G  El. 

In  [21,  35],  Berkovitz’s  approach  of  constructing  optimal  strategies  [7]  was  shown  to  be  ap¬ 
plicable  to  the  infinite-dimensional  differential  game  as  well.  It  should  be  noted  that  the 
Berkovitz  method  involves  using  some  feedback  maps  to  construct  the  saddle  point  for  the 
game,  where  these  feedback  maps  are  obtained  by  using  some  appropriate  level  sets  related 
to  the  value  function.  Hence,  to  use  this  method  one  needs  to  compute  the  value  function 
J*  for  (3.9),  which  is  a  partial  differential  equation  in  an  infinite-dimensional  space  with 
the  unbounded  operator  A  involved.  To  overcome  some  of  these  difficulties,  the  authors 
in  [34]  first  approximate  the  unbounded  operator  by  a  bounded  operator  (e.g.,  the  Yosida 
approximation),  and  then  study  the  associated  approximate  differential  games  with  these 
bounded  operators  and  obtain  their  value  functions.  In  addition,  the  authors  showed  that 
the  value  function  for  the  original  differential  game  is  the  limit  of  the  value  functions  of  these 
approximate  differential  games,  and  the  saddle  point  for  the  approximate  differential  game 
is  an  approximate  saddle  point  for  the  original  differential  game.  Although  this  method 
makes  the  problem  conceptually  easier,  one  still  needs  to  solve  a  partial  differential  equation 
defined  in  an  infinite-dimensional  space.  In  the  remainder  of  this  paper,  we  address  this  issue 
by  employing  Galerkin  approximation  techniques  to  reduce  the  problem  to  one  in  a  finite 
dimensional  space.  This  approximation  technique  has  been  used  in  [4]  to  establish  a  compu¬ 
tationally  feasible  approximation  theory  for  linear  quadratic  regulator  control  problems  for 
infinite  dimensional  systems  with  unbounded  input  operators. 


3.2  Value  of  Approximate  Differential  Game 

In  this  section,  we  first  employ  Galerkin  approximation  methods  to  obtain  a  finite-dimensional 
approximation  of  the  infinite  dimensional  system  (3.4).  Then  we  show  that  the  associated 
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approximate  differential  games  indeed  have  values,  and  these  value  functions  converge  point- 
wise  to  the  original  value  function  J* . 

Let  VN  be  a  sequence  of  finite-dimensional  subspaces  of  V  C  H.  We  assume  the  following 
standard  [4,  Sec.  4]  approximation  condition 


(HI)  For  any  0  G  V,  there  exists  a  sequence  0W  in  VN  such  that  \\(j>N  —  0||v  — >  0  as  N  — >  oo. 


Dehne  the  operator  AN  :  VN  — >  VN  (which  approximates  A)  by  restriction  of  a  to  VN  x  VN; 
this  yields 

(—.4^0,0)  =  a(0,  0),  for  all  0, 0  G  VN . 

Let  the  operator  VN  denote  the  usual  orthogonal  projection  of  El  onto  VN .  That  is,  for 
0  G  HI,  we  have  VN  0  G  VN  is  defined  by 


(VN4>,ip)  =  (0,0),  for  all  0  G  VN . 


(3.10) 


It  then  follows  from  (HI)  that  || VN<f>  —  0||h  — »  0  as  N  — y  oo  for  any  0  G  HI.  In  addition,  this 
projection  operator  can  readily  be  extended  to  VN  :  V*  — >  VN  by  replacing  (0,0)  in  (3.10) 
by  (0, 0)v*,v  for  all  0  G  V*.  For  this  family  of  approximations,  the  approximate  problem 
corresponding  to  (3.3)  is  given  by 

dV  Jf*  =  ANvN{t)  +  TN(ue(t),  uw(t)),  vN(t0)  =  Vq.  (3.11) 

Here  vN(t)  is  the  notation  for  vN(t,-),  JrN(ue(t),  uu{t))  =  VNJr(ue{t),  uu(t)),  and  Vq  = 
Vnvq.  Note  that  for  any  given  uE  and  «w,  T (ue(t) ,  uw(t))  G  C([x,x])  C  HI.  Hence,  for 
any  0  G  VN  we  have  (A*  (ue(t),  uw(t)),  0)  =  (J-(ue(t),  u^it)),  0).  Thus,  the  weak  form  of 
(3.11),  i.e. ,  the  approximate  problem  corresponding  to  (3.4),  can  be  formulated  as  finding 
vN{t)  G  VN  which  satisfies 


^  +  =  (^(w(t),  uu(t)),  0),  0  G  V", 

vN(to)  =  V?. 


(3.12) 


Let  Qn  :  V7v  — >•  M  defined  by  <0V(0)  =  /  ^{x)dx  for  any  0  G  VA.  Then  the  approximate 


tn 


->N 


N 


problem  corresponding  to  (3.7)  is  given  by 


dv0N(t ) 
dt 


QN(vN(t)),  v0N(t0)  =  0. 


(3.13) 


Let  {0^}^!  be  a  basis  of  YN ,  TA  =  (0^,0^, 
entry  being  (0^,00).  Then  there  exists  vN(t) 


. . .  ,0(^)T,  and  Qn  G  RNxN  with  its  (i,j)th 
=  zyf  (t),  •  •  • ,  u^(t))T  G  lw  such  that 


vN(t)  =  (/(f))V. 
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Substituting  the  above  equality  into  (3.12)  and  letting  i/j  =  for  j  =  1,  2, . . . ,  N,  we  can 
obtain  the  matrix  representation  AN  of  operator  AN  (that  is,  ANvN  =  (ANuN)T^IN)  and 
the  vector  representation  FN (ue(t) ,  uu(t))  of  FN (ue(t) ,  uu(t))  (that  is,  FN (uE(t) ,  uu(t))  = 
(FN(u£(t),ulv(t)))'1  TjV)  with  respect  to  the  basis  These  are  given  by 

AN  =  (QN)~l^N,  where  G  RNxN  with  its  (i,j)  element  being 


and 


F^u^t)^^))  =  (QN)-^(F(u£(t),u^t)),^),...,(F(u£(t),u^t)),^))\ 


respectively.  Let  Uq  be  the  vector  representation  of  %  with  respect  to  the  basis  {'0»}*=1- 
Then  we  obtain 

<  =  o qny\m r>,  M2),  •  •  • , 

We  can  reformulate  (3.11)  in  terms  of  the  system  of  ordinary  differential  equations 
duN(t) 


dt 


=  ANvN{t)  +  FN{ue{t),uu{t)),  vN{t0)  =  v<\ 


(3.14) 


which  has  a  unique  solution.  Hence,  (3.11)  also  has  a  unique  solution,  which  can  be  written 


as 


v 


N 


(t)  —  etANVNv0  +  /  e(t  S)'4JV FN(u£(s),uu(s))ds. 


't0 


N  /  fj 

Let  Gn  :Rn  ^  M  defined  by  GN  {rj)  =  ^  I  / 

RN .  Then  (3.13)  can  be  rewritten  as 

.ON  I 


(3.15) 


W  ( x)dx  rjj  for  any  77  =  (771,  rj2,  ...,Vn)  G 


(t)  _  r,N t.,N 


=  GN(vN(t)),  v0N  (t0)  =  0. 


dt 


(3.16) 


ON  _  ..OJV 


That  is,  we  set  z/UiV  =  v 


The  motion  of  corresponding  TVtli  approximate  differential  game  (in  VN)  associated  with  a 
pair  of  strategies  (re,rw)  is  denoted  by 


[  • ;  t0,  tf ,  re,  rw]  =  (v0N[  ■  ■  t0,  <,  re,  rw],  • ;  t0,  <,  re,  rjf 

and  the  set  of  all  vN[  ■  ;t0,Vo  ,T£,TU]  is  represented  by  1/V[  •  ;  to,  Vq,  T£,  rw].  Similarly, 
ViV[  •  ■,t0,Vo,Te,ruj]  and  V0jV[  •  [t0,  v* ,  Te,  Tw]  denote  the  set  of  vN[  •  ;  i0,  Te,  Tw]  and 
v0N[  ■ ;  to,  Vq  ,  r£,  rw],  respectively.  The  approximate  payoff  corresponding  to  a  pair  of  strate¬ 
gies  (T£,  Tw)  is  given  by 


JN(T£,  Tw;  t0,  <)  =  V07v  [T;  t0,  <,  W,  Tw]. 


.N 


ON  r 


,N 


(3.17) 


Instead  of  directly  consider  the  Nth  approximate  differential  game,  we  may  also  consider  its 
corresponding  matrix  representation  (i.e.,  the  differential  game  in  RN).  The  motion  for  the 
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corresponding  matrix  representation  of  the  Nth  approximate  game  associated  with  a  pair  of 
strategies  (T£,  is  denoted  by 


p" [ • ; to, vo , r£, rw]  =  (u0N[ • • t0, , r£, rw], vN[-- t0, , r£, rw]) 


and  the  set  of  all  uN[  ■  ;  to,  Vq  ,  re,  Tw]  is  represented  by  VN[  ■  ;  to,  Vq,  re,  Tw].  Similarly, 
VN[  ■  ;t0,Vo,Te,ru]  and  O0Ar[  •  ;  t0,uj^,  Te,  Tw]  denote  the  set  of  vN[  •  ;  t0,  v* ,  Te,  Tw]  and 
v0N[  ■ ;  t0, vq  ,  r£,  rj,  respectively.  Then  we  have 


.Nr 


• ;  t0,  v»,  re,  rw]  =  K[-; 10,  < ,  re,  r  j)rT 


.JVr 


,JV 


\TlT,iv 


Thus,  we  obtain 


• ;  to,  V”,  re,  rw]  =  v0N[  • ;  t0,  r£,  rw]. 

Therefore,  the  payoff  (3.17)  can  be  equivalently  written  as 

Jn(T£,  T^;  t0,  i£)  =  V0N [T ;  t0,  ,  Te,  Tw]. 

That  is,  for  any  given  Vq  =  (u^  )TTiV  we  have 

77JV(r£,raj;t0,n^r)  =  JN(re,ru-t0,v”) 

holds  for  any  pair  of  strategies  (Te,  Tw). 


(3.18) 


(3.19) 


We  now  consider  the  approximate  differential  game  (in  WLN)  with  varying  initial  points  (r,  £)  G 
[t0,T]  x  R^.  we  define 


JN~( 


r,  £)  —  sup  inf  J 

ry  r 


N 


(r£,rw;r,e),  j 


N+ 


(t,  £)  =  inf  sup  J 
r, 


N 


(Te,rw;r,0- 


Then  JN  and  JN+  are  uniformly  Lipschitz  continuous  on  bounded  subsets  of  [to,T)  x  ~RN 
(see  [7,  Theorem  7.2]).  For  any  given  £,  rj  G  WLN ,  we  define 

HN{ne,nu-,£,rj)  =  r]TFN{^iiu)  +  G"(£),  He  e  ^,/G  e  £/”*• 

Observe  that  for  any  given  rj,  £  G  R^,  77^  is  continuous  and  linear  in  each  variable.  Since  Ul 
and  Um  are  both  compact  and  convex,  we  have  HN~(£,rj)  =  HN+(^,rj)  for  any  £,  r/  G  R^, 
where 


HN~{Z,ri) 


max  min  HN(ne,nu\£,rj), 


HN+(tv) 


min  max  HN(a£,au;^,r]). 
neeul  n*eum 


Let  Hn*  =  Hn  =  Hn+ .  Then  we  know  that  there  exists  a  value  JN*  =  JN  =  JN+  and  it 
is  the  unique  viscosity  solution  of  the  HJI  equation  given  by  (see  [8]) 


JN*  +  (An^tvjn*  +  hn *  ^  V  jN*j  =  o5  (Tj  f )  e  (to,  T )  X  Rn, 

Jn*(T,£)  =  0. 


(3.20) 
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Thus,  the  TVth  approximate  differential  game  (in  VN)  with  initial  point  (r,  ip)  G  [to?  T]  x  VN 
also  has  a  value  and  it  is  given  by  JN*{r)  ip)  =  JN*(t,£),  where  £  is  the  vector  representation 
of  ip  with  respect  to  the  basis  {ip^}^=1  (i.e.,  ip  =  £7  '&N). 

In  the  conclusion  of  this  section,  we  show  that  the  value  function  for  the  iVth  approximate 
differential  game  converges  pointwise  to  the  value  function  of  our  simplified  relaxed  differen¬ 
tial  game.  We  first  recall  the  following  convergence  result,  which  is  standard  in  the  literature 
(e.g.,  see  [4,  Lemma  4.1]  or  [26,  Chapter  III]). 

Lemma  3.2.  Letue  and uw  be  given.  Suppose  (HI)  is  satisfied  and v0  G  EL  IfvN(t;t0,VNv0,ue,uw)  G 
VN,  t  >  t0,  satisfies  (3.12),  then  we  have 

|| vN(t;t0,VNv0,u£,u<JJ)  -  v(t]to,v0,ue,uu) ||H  -»  0  as  N  ->■  oo 


and 


vN(s]  t0,PJVu0,M£,uw)  -  v(s-,t0,v0}uE,uIJJ)\\Yds  — >■  0  as  N  oo 


~>N 


'to 


uniformly  in  t  G  [to,T]. 


In  addition,  by  using  Trotter-Kato  theorem  we  have  the  following  important  convergence 
result  (see  [4,  Lemma  4.3]). 

Lemma  3.3.  For  all  <p  G  M,  we  have 

|| etAN VN <p  —  S (t)0||]H  ->0  as  N  — >  oo,  (3-21) 

where  the  convergence  is  uniform  on  bounded  t-interval. 


Using  Lemma  3.3  and  the  same  arguments  as  those  in  [34,  Lemma  3.2]  we  can  obtain  a 
stronger  result  than  that  presented  in  Lemma  3.3. 

Lemma  3.4.  For  any  relatively  compact  set  icCi, 

snp  sup  \\etAN VN <p  —  >S'(t)0||ii  — »  0  as  N  — >■  oo.  (3.22) 

</>eHc  t0<t<T 

Lemma  3.5.  Suppose  (HI)  is  satisfied.  Then  for  every  v0  G  El,  we  have 

sup  sup  \\vN(t-t0,VNv0,ue,uOJ)  -  u(t;io,u0,ue,uw)||H  0  (3-23) 

(%,«u)6WexW„  t0<t<T 

and 

sup  sup  \v0N(t-t0,VNv0,ue,uu)  -  v°(t;t0,v0,u£,uljj)\  0.  (3.24) 

(Ws,Wu;)eWeXW„  t0<t<T 

as  N  — y  oo. 
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Proof.  Let  F{Ul  x  Um)  =  |  (fie,fiu)  E  Ulx  Um}.  Note  that  T  :  Ul  x  Um  — )■  H  is 

continuous  and  Ul  x  Um  is  compact.  Hence,  x  Um)  is  compact.  By  (3.5)  and  (3.15) 
we  have 


|| vN(t;t0,VNv0,ue,uIJJ)  -  v(t]  to?  Vq,  ue,  uw)  ||jj 
<  ||eMiV'PAruo  -  5'(t)u0||H 

+ 

Thus,  by  the  above  inequalities,  the  compactness  of  T(fJl  x  Hm)  and  Lemma  3.4  we  obtain 
the  desired  result  (3.23). 

Note  that  HI  is  continuously  embedded  in  L1(x,  x).  Hence,  there  exists  a  positive  constant 
a  such  that 


e(t  s)^p^(M£(s);Mw(s))  -  ^(f  -  s)Jr(u£(s),uUJ(s)) 


'to  L 


ds 


H 


| vON(t]to,VNv0,u£,uUJ)  -  n0(f;to,n0,u£,uw)| 

rx 

(vN(t,  x ;  to,  VNv0 ,ue,  uj)  -  u(t,  x;  t0,  u0,  u£,  ujj)  dx 


<  a\\vN(t-,t0,VNv0,u£,uw)  -  v(t;t0,v 0,  u£,  uw)||h. 

Thus,  by  (3.23)  and  the  above  inequality  we  have  the  desired  result  (3.24). 
Theorem  3.6.  For  every  t  e  [fo,T]  and  ip  e  HI,  we  have  JN*{r,VN  <p)  — >•  J*{r,p). 


□ 


Proof.  Fixed  (r,  </?)  G  [t0,  T]  x  H.  For  any  given  8  >  0,  there  exist  a  strategy  and  a 
strategy  such  that 

supl7(r°,ra;;r,^)  <  J*(r,(p)  +  ^  (3.25) 

r  4 

-L  UJ 


and 


infJ(r£,r°;r,^)  >  J*(r,<p)  -  5-. 


(3.26) 


Then  for  any  given  positive  integer  k ,  there  exists  a  strategy  T, ,  and  an  associated  motion 
vk[-]T,Vkip,  r°,r£]  such  that 


d 


Jk*(r,Vkip)  <  sup  Jk{T°e,Yu\T,  Vkif)  <  v"k[T-,T,Vkif,  F^Tk]  +  -,  (3.27) 

r  4 

-L  CJ 

and  there  exists  a  strategy  Yk  and  an  associated  motion  vk[  ■  ;  r,  Vk>p,  Tk,  T°]  such  that 

Jk*(r,Vkif)  >  inf  Jk{Ye,  ;  r,  Vkip)  >  vok[T;T,Vk<p,rk,r°J  -  (3.28) 
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By  Lemma  3.5  we  know  that  there  exists  a  positive  integer  Nq  such  that  for  any  N  >  No  we 
have 

£ 

sup  sup  \v0N(t-,T,VN<p,Ue,Uu)-V0(P,T,ip,Ue,Uu)\<-. 

(«£,«„)ewexw„  r<t<T  D 


For  any  given  N  >  N0,  let  vN[  ■  ;  r,  VN(p,  T^]  be  the  uniform  limit  of  subsequence 
{vN{  •  ]T,VNv,u°£’nN,u^nN)}  of  sequence  {vN\  ■  ■,T,VNip,u°£’n,ukjn)}.  By  Lemma  3.1  and 

inequality  (3.8),  we  know  that  there  exists  a  subsequence  j  (^u^’nNN ,  Uu’nNN  j  j  of  sequence 

{(■u°’njv,  u^,nN)}  such  that  ^  •  ;  r,  ip,  -u°’njVjv,  u^’nNN  j  |  converges  uniformly,  and  we  denote 

this  uniform  limit  by  h[  •  ;  r,  <p,  T°,  T^].  Then  for  any  given  N  >  N0  we  can  always  find 
sufficiently  large  Unn  such  that 


v0N[T ;  r,  VNip,  r°,  r"]  -  v° [T ;  r,  ip,  r°,  T%] 


>7V,_  pO  pAT' 


0  pTVi 


< 


v0N[T ;  r,  Vl\,  r",  r"]  -  uUiV  (T;  r,  V!\,  u. 


>iV  pO  pAT 


OAT/ 


jV  O.njVjY  N,nN]s 


5  W'o;  , 


< 


+ 

+ 


uoiV(T;  r,  P  V  )  -  v°(T;  r,  ip,  u’nN" , 


n°(T; 


£  5 

0  ,nNjv  N  ,nN 


(3.29) 


N  N 

'£  5 


Similarly,  for  N  sufficiently  large  we  can  also  find  a  motion  v[  • ;  r,  9?,  T^,  r°]  such  that 

5 


v0N[T ;  r,  7^,  rf,  T°]  -  u°[T;  r,  p,  Tf ,  T°]  < 


(3.30) 


By  (3.25),  (3.27)  and  (3.29)  we  find  that  for  N  sufficiently  large  we  have 
JN*(r,  VN<p)-J*(r,  ip) 

<  von[T;t,pn(p,  r°,r^]  + 1  -  ^sup  j(r°,rw;T,<^)  - 1 


<  r,  vNp:  r°,  r"]  -  v°[t;  t,  r°,  r"]  + 

<  5. 


5 


(3.31) 


Similarly,  by  (3.26),  (3.28)  and  (3.30)  we  find  that  for  sufficiently  large  N  we  obtain 

J*(r,  p)  - JN*{t,  VN<p) 

<  iU  J(U,  r« ;  r.  V)  +  i  -  (vm  [T;  r.  VN<p,  rf ,  r»]  -  | 


/  n.0\niw  pAT  pO  ]  „,0N  [it,.  -pAf  pjV  pO  1  ,  ^ 

<  V  [2  ;  T,  ip,  I  e  ,  i  J  -  V  [1  ,T,r  If,  f  e  ,  1  J  +  - 

<  5. 


(3.32) 


Thus,  the  desired  convergence  result  follows  from  (3.31)  and  (3.32). 


□ 
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3.3  Saddle  Point  for  the  Approximate  Differential  Game 


In  the  previous  section,  we  have  shown  that  when  N  is  sufficiently  large  the  value  function 
JN*  for  the  TVtli  approximate  differential  game  is  an  approximation  of  value  function  J* 
for  our  simplified  relaxed  differential  game.  Hence,  instead  of  seeking  optimal  strategies 
for  our  simplified  relaxed  differential  game,  we  find  optimal  strategies  for  its  associated 
approximate  differential  game.  First  we  define  the  approximate  optimal  strategies  for  our 
simplified  relaxed  differential  game  (see  the  corresponding  definition  in  [34]). 

Definition  3.7.  A  strategy  r*  is  said  to  be  8-optimal  ( for  the  evader)  of  the  simplified  relaxed 
differential  game  with  initial  point  (f0,vo)  if 

sup  J (r*,  t0,  Vo)  <  J*(t0,v o)  +  S. 

rw 

A  strategy  r*  is  said  to  be  8-optimal  (for  the  interrogator)  of  the  simplified  relaxed  differential 
game  with  initial  point  ( t0,v0 )  if 


inf  J (G,  T* ;  to,  Vo)  >  J*(t0,v0)  -8. 

IfT*  and  r*  are  8-optimal,  then  we  say  that  (r*,  T*)  constitute  a  8-saddle  point  (or  8-optimal 
strategies)  for  the  simplified  relaxed  differential  game  with  initial  point  (t0,v o). 


The  below  result  shows  that  the  optimal  strategies  for  the  TVth  approximate  differential  game 
are  indeed  approximate  optimal  strategies  for  the  simplified  relaxed  differential  game  when 
N  is  sufficiently  large. 

Theorem  3.8.  Let  (T^*,  T^*)  be  a  saddle  point  for  the  Nth  approximate  differential  game 
with  initial  point  (t0,  VNv0).  Then  for  every  8  >  0  there  exists  N$  such  that  for  all  N  >  Ns, 
rf*  and  rf*  are  8-optimal  strategies  for  the  evader  and  interrogator,  respectively,  for  the 
simplified  relaxed  differential  game  with  initial  point  (t o,vo)- 


Proof.  Since  (Tf*,Tf*)  is  a  saddle  point  for  the  Nth  approximate  differential  game  with 
initial  point  (t0,VNVo),  we  have 

sup  JN( rf ,  rw;  to,  VNv0 )  <  VNv0)  <  inf  JN(T£,  T%* ;  t0,  VNv0).  (3.33) 

For  any  given  8  >  0,  there  exists  a  strategy  and  a  motion  u[  •  ;to,vo,rf*,  T°]  such  that 

5 


sup  J( rf ,  rw;  t0,  v0)  <  v°[T ;  t0,  u0,  Tf ,  T°J  +  ^ 
r  4 

1  OJ 


(3.34) 


and  there  exists  a  strategy  and  a  motion  u[  •  ;  t0,  v0,  such  that 


inf  J( re,  to,  Vo)  >  v°[T ;  t0,  v0, 


(3.35) 
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By  Theorem  3.6,  we  know  that  for  any  given  5  >  0  there  exists  a  positive  integer  N$i  such 
that  for  any  N  >  N$ i  we  have 


J*(t0,v0)  -  S-  <  JN*(t0,VNv 0)  <  J*(t0,v0)  +  S-. 


(3.36) 


In  addition,  by  using  the  same  arguments  as  those  in  Theorem  3.6  we  know  that  there  exists 
a  positive  integer  N52  such  that  for  any  given  N  >  Ns 2  we  can  always  find  a  corresponding 
motion  vN[-  ;to,  Vvq ,  T^*,  T°]  so  that 


vN0[T ;  t0,  Vvq,  T^*,  T°]  —  v°[T ;  t0,  u0,  T^*,  T°]  I  <  -A, 


and  a  corresponding  motion  vN[  ■  ;  to,  vq,  T^,  T^*]  so  that 


''O  T^N*~\ 


|uJVO[T;to^o,r°,T^]  -u°[T;to^o,r£°,rf]|  <  5~. 

Let  Ns  =  maxjA^!,  IV^}.  Then  by  (3.33),  (3.34),  (3.36)  and  (3.37)  we  find 

sup  t0,  v0)  <  J*(t0,v0)  +  5, 


(3.37) 


(3.38) 


(3.39) 


and  by  (3.33),  (3.35),  (3.36)  and  (3.38)  we  obtain 

inf  J(V£,  T^*;  t0,  v0)  >  J*{t0,v0)  -  5. 


(3.40) 


Thus,  for  all  N  >  Ns,  T(.v*  and  T(J*  are  4-optimal  strategies  for  the  evader  and  interrogator, 
respectively,  for  the  simplihed  relaxed  differential  game  with  initial  point  ( to,vo ).  □ 


By  Theorem  3.8  we  know  that  in  order  to  construct  approximate  optimal  strategies  for  our 
simplified  relaxed  differential  game,  it  is  sufficient  to  construct  a  saddle  point  for  the  TVth 
approximate  differential  game.  Let  £  =  (£°,  £)T,  fj  =  (77°,  77)^  G  Mw+1,  where  ^°,  770  G  M  and 
£,  7]  G  M.N .  Then  for  any  G  Ul  and  G  Um  we  define 

=  VTFN(ii£,^)  +  V°GN(0-  (3-41) 

Observe  that  for  any  given  £,77  G  MAr+1,  HN  is  continuous  and  linear  in  each  variable.  Since 
Ul  and  Um  are  both  compact  and  convex,  we  have 

max  min  HN (u£,  my  £,  77)  =  min  max  HN (u£,  uw;  £,  fj), 

fj,seul  "  ve&u1  nu,eu™ 

which  is  essential  to  construct  the  optimal  strategies. 

In  the  remainder  of  this  section  we  will  follow  the  Berkovitz  method  [7]  to  construct  the 
saddle  point  strategies  for  the  iVth  approximate  differential  game.  For  notational  simplicity, 
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we  suppress  the  dependence  of  the  strategies  on  N.  Let  (to,  v™)  be  the  initial  point  of  the 
game  and  aN  =  JN*(to,u^).  We  define  the  level  sets 

C°(aN)  =  {(t,0  G  [t0,T\  x  RN+1  I  £°  +  JN*(t,Z)  >  aN} 

C0 (aN)  =  {(t,0  e  [t0, T]  x  RN+1  |  £°  +  JN*(t,f)  <  aN}. 

Since  JN*  is  continuous  and  (t0,  (0,  z/^)T)  G  C°(aN)  D  C0(aAr),  the  level  sets  C0(o;Ar)  and 
C°(o;Ar)  are  nonempty  closed  sets. 


First  we  define  a  feedback  strategy.  Let  p£  =  {pf},  where  pf  :  [t0,T]  x  — >  Ul  is  called 

a  feedback  map  for  the  evader.  The  function  p£  determines  a  strategy  T£  =  T£(p£)  in  the 
game  with  initial  point  (t0,  //,f )  as  follows.  Let  nn  =  {t0  =  tn’°  <  tnA  <  ...  <  tn,n  =  T} 
be  the  partition  of  [to,T]  which  divides  [to,T]  into  n  subintervals  of  same  length.  Dehne 
T"’1  =  p£(t0,  (0,  Vq)t).  For  2  <  j  <  n  and  (uE,uJ)  G  74  [to,  x  Uu[to,  tnj_1),  we  dehne 


r  p(ue,uu)=p»(tn*-1,pN(tn*-1)), 


where  vN  (t)  =  (z/ow  (t;  t0,  uf ,  u£,  uu),  u N  (t\  t0,  uf ,  «e,  (i.e.,  (t-,to,v{f  ,u£,ua})  is  the 

solution  to  (3.16),  and  vN(t;  t0,  ,  u£,  uu)  is  the  solution  to  (3.14))  for  t0  <  t  < 

The  strategy  T£  =  T£(p£)  determined  this  way  is  called  a  feedback  strategy  for  the  evader 
corresponding  to  feedback  map  p£.  In  a  similar  fashion,  for  every  sequence  of  feedback  maps 
Puj  =  {p7f}  f°r  the  interrogator  with  p'f  :  [to,T]  x  MJV+1  — >  Urn  we  can  dehne  an  associated 
feedback  strategy  T^  =  for  the  interrogator. 


We  next  turn  to  construct,  for  a  given  partition  nn ,  optimal  feedback  maps,  which  are 
extremal  to  the  level  sets.  If  (£,£)  G  C0(a;Ar),  then  we  dehne  pf*(t,tf)  =  /xe>0,  which  is  an 
arbitrarily  hxed  vector  in  Ul.  Otherwise,  consider  the  set  C0t  =  {C  |  (t,  ()  G  C0(o;Ar)}  (which 
is  not  empty  due  to  [7,  Lemma  8.3])  and  choose  a  point  (*  G  Cot  such  that  ||£*  —  £||  = 
min  ||£  —  £||.  Let  fj*  =  £  —  (*.  Dehne  pf*(t,f)  =  p*,  where  (/i*, /j*  )  is  any  saddle  point 

)?eCo  t 

of  the  local  game  with  payoff  H  (p£,  £,  fj*)  dehned  by  (3.41).  The  feedback  strategy 

corresponding  to  this  sequence  of  feedback  maps  p*  =  {p£*}  is  denoted  by  F*.  In  an  analogous 
way,  using  C°(o;iV)  in  place  of  C0(aAr),  one  can  construct  the  sequence  of  feedback  maps  {p7f*} 
and  the  corresponding  feedback  strategy  T*  for  the  interrogator.  Then  by  [7,  Lemma  10.1] 
we  know  that  all  motions  vN[t;  to,  ,  F*,  rw]  and  vN[t;  to,  Vq  ,  T£,  T*]  lie  entirely  in  Co(aAr) 
and  C°(aN),  respectively.  That  is, 


u°N  [t]  t0,  Vo,  r*,  rw]  +  jN*(t,  vN[t-,  t0,  i$,  r*,  rw])  <  jN*(t0,  u(?) 


and 


v0N[t- to, itf ,  re, r*]  +  jN*(t,  v" [t- to, itf,  re, r*])  >  JN*(t0, itf ). 


,N 


tN*  : 


,N 


N 


jN*  ( 


,N\ 


Note  that  JN* (T,  vN[T ;  t0,  vf ,  Y*,  rw])  =  0  and  JN*(T ,  vN[T ;  t0,  ,  T£,  T*])  =  0.  Hence,  we 

have 


v0N [t ;  to, r:,  rw]  <  JN*[to,  v$)  <  vm[T-  to,  ,  y£,  r:], 


By  (3.18)  and  (3.19),  we  can  equivalently  write  the  above  inequalities  as 


PN  \T-  : 


N 


)  <  V 


m[T;t0,vg,re,r 


*  ■ 
oj\  * 


Thus,  the  pair  (T*,  T* )  constitutes  a  saddle  point  for  the  Nth  approximate  differential  game 
with  initial  point  (to,  Vq)  . 


4  Concluding  Remarks 


In  this  paper  a  two-player  zero-sum  dynamic  differential  game  is  considered  in  the  context 
of  electromagnetic  pursuit-evasion  games.  The  formulations  are  a  natural  extension  of  the 
corresponding  static  games  developed  in  [2]  in  the  context  of  relaxed  strategies.  The  cost 
functional  here  is  based  on  the  expected  value  of  the  intensity  of  the  reflected  signal.  We 
established  that  the  resulting  relaxed  differential  game  has  a  saddle  point,  which  is  found 
to  be  difficult  to  compute  due  to  lack  of  the  joint  continuity  of  the  cost  functional  on  the 
relaxed  controls.  To  overcome  this  difficulty,  we  then  consider  a  special  case  of  this  game  in 
which  both  players  have  only  finite  number  of  control  choices  available  at  each  time.  This 
simplified  game  is  studied  with  strategies  and  payoff  in  the  sense  of  Berkovitz  [7],  and  is  shown 
to  have  a  value  where  the  value  function  is  the  unique  viscosity  solution  of  Hamilton- Jacobi- 
Isaacs  equation  in  the  sense  of  Crandall-Lions  [13].  We  then  employ  Galerkin  approximation 
techniques  to  reduce  this  simplified  game  to  ones  in  finite  dimensional  spaces.  The  value 
functions  of  the  associated  approximate  differential  games  are  shown  to  converge  pointwise 
to  the  value  function  of  this  simplified  game.  In  addition,  we  show  the  optimal  strategies 
for  the  approximate  differential  game  are  approximate  optimal  strategies  for  the  simplified 
game.  It  is  useful  to  observe  that  the  computational  framework  presented  in  this  paper  is 
also  applicable  to  differential  games  with  state  governed  by  a  general  semilinear  evolution 
equations  such  as  those  studied  in  [21,  34], 
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